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Abstract 

We propose a new type of approximate counting algorithms for the problems of enumerating 
the number of independent sets and proper colorings in low degree graphs with large girth. Our 
algorithms are not based on a commonly used Markov chain technique, but rather are inspired by 
developments in statistical physics in connection with correlation decay properties of Gibbs measures 
and its implications to uniqueness of Gibbs measures on infinite trees, reconstruction problems and 
local weak convergence methods. 

On a negative side, our algorithms provide e-approximations only to the logarithms of the size 
of a feasible set (also known as free energy in statistical physics). But on the positive side, our 
approach provides deterministic as opposed to probabilistic guarantee on approximations. Moreover, 
for some regular graphs we obtain explicit values for the counting problem. For example, we show 
that every 4-regular n-node graph with large girth has approximately (1.494 . . .)" independent sets, 
and in every r-regular graph with n nodes and large girth the number of q > r + 1-proper colorings 
is approximately [q{l — |)^]", for large n. In statistical physics terminology, we compute explicitly 
the limit of the log-partition function. We extend our results to random regular graphs. Our explicit 
results would be hard to derive via the Markov chain method. 

1 Introduction 

Counting is a natural counterpart to a combinatorial optimization problem. The typical set up involves 
counting the number of feasible solutions to some combinatorially constrained problem. The most widely 
studied such problems involve counting the number of solutions to a bin packing problem |JS97j . counting 
the number of independent sets (also known as hard-core model in statistical physics) |LV97j . |DGJ04j . 
matchings |JS97j . proper colorings in graphs (Potts model in statistical physics) |DG J04j . |DFHV04] . 
volume of a convex body |DaRK9l| . |KLS97] . |LV03j . permanent of a matrix (counting the number of 
full matchings of a bi-partite graph) |Va,]79j . [jF^ . |.ISVn4j . [jSHZl, IHSVVj etc. Typically the set 
of feasible solutions is exponentially large and exhaustive search is computationally prohibited. This 
complexity appears to be fundamentally unavoidable, Valiant |Val79j . Modulo a complexity theoretic 
conjecture, the problems in do not admit polynomial time algorithms, and thus research focused 
on approximation algorithms. Here the most powerful method comes from the theory of rapidly mixing 
Markov chains. The typical setup involves relating counting problem to a sampling problem via certain 
telescoping trick (see for example identity below) and then computing some marginal probabilities 
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using sampling technique. The main technical challenge is establishing that the underlying Markov chain 
mixes in polynomial time (rapid mixing). The scope of Markov chains for which rapid mixing has been 
established includes such notable breakthrough results as Jerrum and Sinclair's |JS89^, and Jerrum, 
Sinclair and Vigoda's |JSV04j proof of rapid mixing of a Markov chain related to permanents, and 
Dyer, Frieze and Kannan |DaRK9l] proof of rapid mixing of a Markov chain related to computing the 
volume of a convex body. Subsequent improvements in running time for computing volumes have been 
established in Kannan, Lovasz and Simonovits |KLS97j and Lovasz and Vempala |LV03j . Somewhat 
closer to the topic of this paper, Luby and Vigoda |LV97j showed that a Markov chain related to 
counting independent sets is rapidly mixing, when the underlying graph has degree at most 4. 

A natural extension of the counting problem is (exponentially) weighted counting, that is computing 
the partition function. Partition function is a fundamental object in statistical physics and thus the 
connection between the counting and statistical physics is well known. There are many results in 
statistical physics literature on computing partition functions in various statistical physics models, but 
unfortunately, most of these results are not rigorous and involve what is known as replica-symmetry and 
replica symmetry breaking cavity method also known as replica symmetry breaking Ansatz |MP V87j . 
The process of rigorization of these spectacular but unproven results by physicists was undertaken 
relatively recently in mathematics: Talgrand |Tal03j proved the validity of the Parisi formula for the 
partition function limit of a Sherrington-Kirpatrick's model. Also Talagrand ITalOl' proved the existence 
and showed a method for computing the partition function limit of a random K-SAT problem in an 
appropriately defined high temperature regime. However, the process of building a full mathematical 
picture of the cavity and replica-symmetry methods is still largely under way. 

In this paper we propose new methods for counting the number of independent sets and colorings 
(computing the partition function) in low degree graphs with large girth. In particular we propose 
a simple polynomial time algorithm for computing approximately the number of independent sets in 
graphs with maximum degree < 4 and large girth. Similarly, for every q we propose a simple computable 
expression for the number of proper g-colorings of any graph with maximum degree r < q — 1 and large 
girth. 

On a negative side our algorithms only approximate exponents of the partition function: for every 
e > we compute e-approximation of the log-partition function (free energy). Also our computation 
time, while polynomial in the size of the graph, is not polynomial in e. Thus our algorithm is PAS 
(Polynomial Time Approximation Scheme) as opposed to FPRAS (Fully Polynomial Time Randomized 
Approximation Scheme) as is typically established using Markov chains method. But there are two 
crucial advantages to our method. First, our algorithms are deterministic and do not suffer from 
sampling error. Second, in special cases involving regular graphs we obtain the values of the partition 
function explicitly. For example we show that in every 4-regular graph with n nodes and large girth, 
the number of independent sets is approximately (1.494. . .)"' irrespectively of the graph! Precisely, we 
show that the logarithm of the number of independent sets divided by n approaches log(1.494. . .) as 
girth increases. The class of regular graphs with large girth is very rich and the fact that the number 
of independent sets is the same in all of them is an interesting by-product of our analysis. The value 
1.494 ... is a numeric approximation of a solution to a certain fixed-point equation. We obtain similar 
limiting numeric values for the case of r-regular graphs when r = 2, 3, 4, 5. For the problem of counting 
the number of proper colorings, we show that for every constant q > r + 1, the number of q colorings in 



our results allow both q and r to be arbitrarily small. All of the known results for counting which are 
based on Markov chain method require q/r to be at least a large positive constant 'D FHV04] . 

The main technical approach underlying our results is the progress in understanding properties of 
Gibbs distributions on regular infinite trees for independent sets, coloring, Ising and some other related 



every r-regular graphs with large girth is approximately 
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models in the context of correlation decay and the connection of thereof to the uniqueness of Gibbs 
measure. We use this stream of work to propose a different method for computing marginal probability 
featuring in cavity equation below. In one of the earliest results in this area, Kelly Kel85j established 
the following phase transition property for independent set on infinite r-regular trees: the probability 
that a root of the tree belongs to an independent set selected according to the Gibbs measure is 
asymptotically independent from the finite depth boundary of a tree, provided that inverse temperature 
A is sufficiently small. The "counting" case A = 1 satisfies this condition for r < 5 but breaks down 
for larger r. A recent extension of this result to general Galton- Watson type random trees and Erdos- 
Renyie type random graphs was done by Bandyopadhyay |Banj . Similar uniqueness property is also 
known for Ising model ' Geo88j and recently was established for coloring in the case of g > r + 1 colors 
by Jonasson j.Tonfl2 . closing an open problem posed earlier by Brightwell and Winkler |BWn2j . The 
correlation decay property (long-range independence) featured lately very prominently in a variety of 
contexts including Aldous' proof of the (^2-li™it for the random assignment problem |Ald01j . bivariate 
uniqueness and endogeny of recursive distributional equations in Aldous and Bandyopadhyay jABDSj . 
Bandyopadhyay jBan02,, Bandyopadhyay |Ban| . Warren |War05j . the local weak convergence properties 
Aldous and Steele |ASn3j . Gamarnik, Nowicki and Swirscsz |(TNSaj . [GNSbj . Gamarnik |(Tamn4j . and 
the problems of reconstruction on a tree, Mossel |Mos04j . Yet, the importance of the correlation decay 
property for the uniqueness of Gibbs distribution was well recognized long time ago in the fundamental 
works by Dobrushin |Dob70j dating back to 70's. While Dobrushin's work was conducted primarily for 
lattices, there is a recent extension of this work by Weitz |Weir)5j to more general graphs. 

In this paper we establish the correlation decay property for independent sets, similar to the one 
considered by Kelly |Kel85j but for an arbitrary (not necessarily regular) tree with maximum degree at 
most 4. This property coupled with the cavity trick almost immediately leads to a simple algorithm 
for computing approximately the partition function for independent sets. The corresponding algorithm 
for colorings is obtained by a simple extension of the Jonasson's |Jonn2j uniqueness theorem for colorings. 
Methodologically, our approach consists of implementations of the following 3 steps. First computing 
appropriate marginal probabilities on a tree. This step typically involves a very simple recursive type 
computation. Then showing that the boundary has a vanishing impact on this marginally probability 
(correlation decay). Finally, the correlation decay is used to project the results of computation of 
marginal probabilities to non-tree graphs with locally tree-like structure. 

Our explicit results for regular graphs are obtained by explicit computations of marginal prob- 
abilities for regular trees. An additional technical difficulty is the fact that the cavity step "de- 
stroys" the regularity of the graph. A simple trick introduced by Mezard and Parisi |MPf)5j . (see 
also Rivoire et.al |RBMM04] ) fixes this problem via some "rewiring" step. The regime corresponding 
to the correlation-decay property in our sense, is called a liquid phase. Our results then can be viewed 
as a rigorous treatment of liquid phase solution for independent sets model. Thus our work strength- 
ens further an interesting and intriguing connection between the statistical physics and the theory of 
algorithms. 

The rest of the paper is organized as follows. In the following section we provide the necessary 
background and definitions. Main results and their extensions, including the extensions to random 
regular graphs are presented in Section |31 Proofs are derived in Sections I4I5I6I Some conclusions and 
open problems are presented in the Sectional 

2 Notations and basics 

Throughout the paper we consider a simple graph G with the node set V = {vi, . . . , f„} and edge set 
E = {ei, . . . ,em}- We also write n = n{G) = \V\ for the number of nodes in the graph. With some 
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abuse of notation we will be writing v € G, if node v belongs to the node set V of the graph G. For 
every v £ G, r{v) = r{v,G) denotes the degree of v in G. N{v,G) denotes the set of neighbors of v 
in G. The maximum degree and the girth (size of the smallest cycle) of G are denoted by r = r(G) = 
maxi<fc<„ r(ufc) and g = g(G) respectively. Let GQ{n,g,r) be the set of all degree-r graphs G with n 
nodes and girth at least g. Let also Q(n,g,r) be the set of all r-regular graphs G with n nodes and 
girth at least g. Typically, we will be considering graphs with constant r, but girth diverging to infinity 
as a function of n. For every positive integer t and every node Vi, we denote by T{vi,t) the depth-t 
neighborhood of Vi - the set of nodes reachable from Vi by paths of lengths at most t. Clearly g > 2t 
implies that T{vi,t) is a tree for every node Vi. A set I C V is independent (stable) if no two nodes of 
/ share an edge. I = T{G) denotes the set of all independent sets in G. A proper coloring C £ C{q) is 
an assignment C : F — > {1, . . . , g} of nodes V to colors 1,2, ... ,q such that no two nodes which share 
an edge are assigned to the same color. For every q £ N, C{q,G) = C{q) denotes the set of all proper 
colorings of the nodes of G by colors 1,2, ... ,q. Throughout the paper we will only consider the case 
q > r + 1. Then, as is well-known (and straightforward to show), the set C{q) is non-empty. In statistical 
physics literature it is common to call independent sets hard-core model and call colorings g-state Potts 
model |(;eo88j . There is a way of defining a general model which simultaneously includes the model for 
independent sets and colorings by means of graph homomorphisms. This formalism has been used in a 
variety of papers |D(TJfl4j . |BWn4aj . Here, for simplicity we do not resort to this formalism. 

A classical object in statistical physics is Gibbs probability distribution on the sets I,C{q). Fix 
X > 0, Xj,l < j < q called activity parameters. The Gibbs distribution on the set I assigns a probability 
proportional to A'^' to each independent set /. More precisely, 

where / is the random (with respect to Gibbs measure) independent set, and Z{X) = Z{X, G) = 
^j-gjA'^l, the normalizing constant, is called the partition function. A is called inverse temperature 
and the quantity log Z(X) is also called free energy. In order to emphasize the underlying graph, 
sometimes we will denote the Gibbs measure by Pg(')- When A = 1, Z{X,G) = Z{1,G) = \I\ and the 
Gibbs distribution is simply the uniform distribution on the set of all independent sets. 

There exists a way to represent the partition function Z[X, G) in terms of marginals of the Gibbs 
measure in the following sense. Let Gq = G and Gk = G \ {vi, . . . , v^}, k = 1,2, . . . ,n. 



Proposition 1 The following relation holds 

Z{X,Gk) 



Z{X, Gk- 

As a result, 



ZiX,G) = llF^Uvk^I). (2) 



k=l 



This proposition is well known and is used for Markov chain based approximation algorithms for 
counting. We provide the proof for completeness. For convenience we assume that a partition function 
of an empty graph is equal to the unity. 

Proof : The proof is obtained by considering a telescoping product 
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and observing 



Z{X,Gk-i) Z{X,Gk-i) 

■ 

For the case of coloring, the Gibbs distribution on the set C{q) of proper colorings is introduced 
similarly as 

where C is the (Gibbs) random coloring and A = (Ai, . . . , Xq) is a fixed vector of activity parameters, 

Cj = {v ^ V : C{v) = j}, and Z{X) = Z{X,G) = J2c'(^C{q)Y\.i<j<q again the normalizing 

partition function. Again the special case Xj = 1,1 < j < q corresponds to the uniform distribution 
on the set C{q) of proper (^-colorings. In this paper we focus exclusively on this special case and use 
notation Z(q, G) or Z{G) instead. The corresponding analogue of Proposition ^ is somewhat more 
complicated. For a random coloring C selected according to the Gibbs distribution and for any subset 
of nodes A, denote by C{A) the set of colors assigned to A. In particular, C{N{vk,Gk^i)) is the set 
of colors used by coloring C for the neighbors of the node Vk in the graph Gk-i- We will also write 
C{v) for C{{v}) for every node v £ G. Again for convenience we assume that the number of proper 
g-colorings of an empty graph is equal to unity. 



Proposition 2 The following relation holds 

Z{q,Gk-i) 
Z{q,Gk) 

As a result, 



q-¥.G,[\C{N{vk,Gk-i))\\. (3) 



Z{q, G) = \{[q- Eg, [\C{N{vk, Gk-i))\] ] • (4) 
fe=i 

Proof : The second part is obtained again by considering a telescoping product 

l<k<n Z{q,Gk) 



Z{q, G) = ni<fc<n ^z'if'c )^ • prove the first part we observe that 



Z{q,Gk-i)= Yl iq-m)\{C €CiGk):C{Nivk,Gk^i))=m} 

l<m<r(i,fe,Gfe_i) 

where we simply observe that if the coloring C uses m colors for the neighbors of in G^-i then there 
are q — m colors left for Vk itself. Then we divide both parts by Z{q, Gk) and observe that 

^ {C G C(Gfc) : C{N{vk, Gk-i)) = m} 

E ^ wTTm = ¥.G,[\C{N{vu,Gu-i))\]. 



l<m<r(v^,Gk-i) 
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3 Problem formulation and results 



The enumeration (counting) problem we are concerned with in this paper is of computing approxi- 
mately the sizes of the sets X and C(g). Specifically, we are interested in approximating the exponents 
corresponding to the cardinalities of these sets: 

Definition 1 Value a > is defined to he e- approximation of the log-partition function log Z{X, G) if 

logZ(A.G)^^^ logZ(A.C)^ 

n n 

where e > is the error tolerance. 

Given a family of graphs Q, an algorithm A is said to he Polynomial Approximation Scheme (PAS) 
for computing the log-partition function if for every G £ G it produces an e- approximation of log Z{G) 
in time which is polynomial in n. 

The Markov chain based approach for solving the counting problems typically provides approximation 
for the partition function itself and not just a logarithm of the partition function (as our approach 
does). Also it typically runs in time which is also polynomial in e~^. Thus it is called Fully Polynomial 
Randomized Approximation Scheme (FPRAS). On the other hand it provides approximation only with 
some probabilistic guarantee. We stress that the algorithms proposed in this paper provide deterministic 
guarantee, and thus are PAS, albeit the dependence on e can be exponential. A natural intersection of 
two classes is Fully Polynomial Approximation Scheme (FPAS). The difference between different types 
of approximations is non-trivial and is not fully understood. For example, it is yet not clear that FPAS 
is always possible whenever FPRAS is possible. In fact Dyer, Goldberg and Jerrum |DGJ04j provide 
an evidence to the contrary. 

An (infinite) family of graphs Q is defined to have large girth if there exists an increasing function 
/ : N — > N such that lim^^oo f{s) = oo and for every G £ G with n nodes 

9{G) > f{n). 

3.1 Counting independent sets and colorings 

Our first result establishes existence of PAS for computing the logarithm of the number of independent 
sets in graphs. 

Theorem 1 For every family Q of graphs G with maximum degree r < 4 and large girth, the prohlem 
of computing log Z(A, G) when \ = 1 is PAS. 

We have noted in the introduction that a Markov chain based FPRAS has been established by Luby 
and Vigoda |LV97j for all graphs with maximum degree at most 4. We do not know whether these 
apparently similar restrictions are merely a coincidence or not. 

Our corresponding result for counting proper colorings does not require any upper bound on the 
maximum degree. Also it is more explicit and its algorithmic implication is immediate. In Sectional 
we do though describe an algorithm for completeness. 

Theorem 2 Given constants g > r + 1, the numher of q- coloring of graphs G € Qo{n,g,r) satisfies 



lim sup 

^^°°Gego{n,9,r) 



log Z{q,G) _1 ^ log _ 



l<k<n ^ 



0. 



In particular, for every family Q of graphs G with maximum degree r and large girth, the prohlem of 
computing log Z{q,G) is PAS. 
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Note that the bound in theorem above does not put any lower bound restriction on the number of nodes 
n. This is because the quahty of approximation is completely controlled by the girth size. Implicitly, 
however, there is a trivial restriction, since when n < g, the graph has in fact infinite girth, namely, it 
is a tree. In this case, it can be verified directly, that the expression Z is exact number of colorings. 

Our next results provide explicit estimates for the cardinality of the number of independent sets I 
and colorings C{q) in the special case of regular graphs with high girth. 

Theorem 3 Suppose A < (r — l)'"^^/(r — 2)*". Then the partition function Z{X,G) corresponding to 
independent sets satisfies 



lim sup 



l0gZ(A,G) , , ^ r-2, 

^ ^ - -log(x-2(2-x)"~) 



n 



0. 



When r = 2, 3, 4, 5 and A = 1, the corresponding limits for n ^ log are respectively, 

log 1.618 . . . , log 1.545 . . . , log 1.494 ... and log 1.453 .... 

Remarks : One important corollary of this result is that the asymptotic value of the log-partition 
function (limit of free energy) is the same for every r-regular graph with large girth. In particular, 
this result validates the non-rigorous statistical physics approach for computing free energy, where only 
locally-tree like structure and regularity is used in computation of free energy. Such insensitivity result 
cannot be obtained by the Markov Chain sampling technique. 

We now state our main results for coloring. As we already mentioned, we only consider the special 
case Aj = 1, 1 < j < that is the problem of counting the number of colorings. The reason for this 
limitation will be apparent when we discuss the recent result by Jonasson |.Tonn2j . 



Theorem 4 For every q>r + l, the number of q- colorings of graphs G £ Q{n,g,r) satisfies 

-- 0. 



lim sup 



logZ(,,G)_j^g 



n 



1 r 
fl--)^ 



As an immediate corollary of Theorem |2 we obtain that for every constant a > 1, the number of 
q = [ar\ + 1 colorings of graphs G G Q{n,g,r) is approximately (ge~2^)" as g,r — > oo. Recently 
Bezakova, et.al |BSV Vj obtained the following lower bound on \C{q, G)\ in arbitrary n-node graph with 
maximum degree r: \C{q,G)\ > (g — r(l — e~"^))". Thus, when r is large and q = ar for some constant a, 
their bound becomes approximately {q{l — a^^ + (ae)~^)". It is not hard to see that our lower bound is 
strictly superior. For example, when a = 1, their bound gives approximately {qe~^)^ colorings, whereas, 
per our result, the correct limiting value (in log scale) is (g/y^)". Of course out tight estimate comes 
at a cost of the large girth requirement. 



3.2 Applications to random regular graphs 

Random graphs are obtained by drawing a graph from some family of graphs at random according to 
some (typically uniform) distribution. Specifically, an r-regular n-node random graph Gr{n) is obtained 
by selecting an r-regular graph uniformly at random from the set of all r-regular graphs on n-nodes. 
An important feature of such a regular graph is that the number of small cycles is small. In particular, 
for every constant G the expected number of size-C cycles is 0(1) in terms of the number of nodes 
n, I JLROOj . Thus, essentially such graphs have a large girth and we may expect that our results for 
regular graphs with large girth extend to this class of graphs. It is indeed the case as we state below. 
The derivation of these results is very similar to the one used for the class Q{n,g,r). 
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Theorem 5 For every r and every X < {r — iy ^/{r — 2Y, the (random) partition function Z{X,Gr{n)) 
of a random r -regular graph Gr{n) corresponding to the Gibbs distribution on independent sets satisfies 



l0gZ{X,Gr{n)) rr 

> log [x 2 (2 



r~-2 - 
2 



n 

1^ 



with high probability (w.h.p.), as n ^ oo, where x is the unique positive solution of x = l/^i+Xx"^ ). In 
particular, when r = 2, . . . , 5 and A = 1, log Z{X, Gr{n))/n converges w.h.p. to log 1.618 . . log 1.545 . . 
log 1.494 . . . and log 1.453 . . respectively, as n ^ oo. 

Our corresponding result for colorings is as follows. 

Theorem 6 For every r and every q > r + 1, the (random) partition function Z{q, Gr{n)) of a random 
r -regular graph Gr{n) corresponding to the uniform distribution on proper q-colorings satisfies 



log Z{q, Grin)) 

> log 



q 



n 

w.h.p. as n ^ oo. 

Theorem ini is in fact not new. Using the second moment method it was established in |AMn4j . that that 
logarithm of the number of q colorings of a graph Gr{n) divided by n converges w.h.p. to log [q'(1 — |) ^] , 
matching our expression. In fact the range for q for which this is the case includes q < r. However, 
the (second moment) argument relies strongly on randomness of the graph. We stress that our general 
result Theorem m holds for every regular graph with large girth. 



4 Counting independent sets 

The key method for obtaining the results in this paper is establishing a very strong form of correlation 
decay, appropriately defined. Correlation decay is one of the key concepts in statistical physics which 
has been used to established the uniqueness of Gibbs distribution on infinite graphs (on finite graphs 
Gibbs distribution is unique by definition). These questions of uniqueness and correlation decay have 
been considered primarily in on regular trees. Here we reconstruct some of these results and extend 
them to non-regular trees. A strong form of correlation decay which we will establish will then be used 
to project our results to arbitrary graphs with large girth (and additional restrictions dictated by a 
particular context). 



4.1 Independent sets on trees and correlation decay 

Let T be an arbitrary tree with depth at most t. That is the distance from the root (denoted vq) 
to any other node G T is at most t. Denote by B{T) the boundary of the tree - the set of nodes 
with distance exactly t from the root. Any function b : B{T) — > {0, 1} is called a boundary condition 
b. When B{T) is empty the boundary condition is not defined. We think of boundary condition as 
conditioning on which nodes on the boundary belong to an independent set (corresponding value is 1) 
and which do not (value is zero). In particular, for any boundary condition 6, we denote by ¥{vq G I\b) 
the probability of the event "vq belongs to the random independent set /", conditioned on the event 
{v G B{T) : V £ 1} = {v £ B(T) : b{v) = 1}, with respect to the Gibbs measure. Denote by B(T) the 
set of all boundary conditions b on T, and denote by T(t, r) the set of all trees with maximum degree 
at most r and depth at most t. 

Our first result establishes the key correlation decay property of Gibbs distributions of independent 
sets on trees with maximum degree at most 4. 
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Proposition 3 The following bounds holds for every t > 2, T £ T{t, 4), 6, 6i, 62 € B{T) 

\ < nvo ii\h)< l- (5) 

and 

{vo i I\bi) - F{vo i I\h2)\ < (.9)*-2. (6) 

where P(-) is with respect to the Gibbs distribution with A = 1. 

Moreover, given A satisfying A < (r — ly^^ /{r — 2Y , let x be the unique non-negative solution of 
the equation x = 1/(1 + Xx^~^). Suppose all the nodes of T except for leaves and the root have degree 
r, and suppose the root has degree r — 1. Then for all b G B{T) 

\F{vo i I\b) -x\< a\ (7) 

for some constant a = a(A) < 1. If, on the other hand, all the nodes except for leaves, have degree r 
(including the root), then 

\F{v,iI\b)--^\<a\ (8) 

for the same constant a. 

Remark : The second part of the proposition is a known result estabhshed first in Kelly |Kel85j . 
and we simply refer to Kelly's work for the proof. See also |BW04bj (where w corresponds to 1/x — 1), 
and Bandyopadhyay |Banj where the latter work is concerned with the extension of Kelly's result 
to general Galton- Watson type random trees. The constant a (A) approaches unity as A approaches 
(r — iy~^ /{r — ly and can expressed explicitly, but this is not required for our paper. 

Proof : We fix a tree T G T(t, r) and activity A. Denote by fi, . . . , v^, k < r the neighbors N{v(), T) 
of the root. This includes the possibility /c = (the tree consists of only node vq). For every node v £ T, 
T{v) denotes the subtree rooted at v not containing vq, and b{T{v)) denotes the natural restriction of 
a boundary condition b G B{T) to T{v). For every node v, let T{v\b) be the tree obtained by deleting 
the leaves v' G T{v) which have value b{v') = 1 as well as their parent nodes. Let J = I D T{v\b). It is 
immediate that for every independent set I C T, its Gibbs probability with boundary condition b is 

Pt(/ = I\I n B{T) = b)= Pt(.|6)(/ = J) = ^ 

L^J'eX{T{v\b)) ^' ' 

Using convention Pi<j<fe = 1 when = 0, we obtain 

z{x,T{vo\b)) = ^'"= n ( E ^''')+' n ( E 

Ie2{T{vo\b)) l<j<k IeIiT(vj\b)) l<j<k I&2(T{vj\b)),Vj(f:I 

We recognize that 

ni<i<fc(E/gx(Tfa|fc))Al'l) _ ni<j<fc^(A,r(t;,|fe)) 
Z{\T{vQ\b)) Z{\T{vQ\b)) 

Using the previous expression for Z(A, T(uo|6)), we obtain 
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Note, that similar recursion applies to any node v substituting the root vq by replacing T with T{v). 
Specifically, take any node v which is a parent of a leaf in level t in a main tree T, if any exist. That is 
V is located on level t — 1. It has r{v) — 1 children which we denote by ui, . . . , its children. For 

every child Vj,j < r{v) — 1 (if there are any) the value ¥{vj ^ I\b) is either zero or one depending on 
whether b{vj) = or = 1. The recursive equation (jUJ implies that P7-(^)(^; ^ I\b) £ [(1 + A)~^, 1]. 

Now, suppose that v is any node on level t — 2 and suppose it has r{v) — 1 children. Then applying 
the same recursion and the previously obtained bounds, we get 

<nv^m<--—^-—j-^< 



l + A- ^ ^ ' ^ - l + A(l + A)-K^)+i - l + A(l + A)-'-+i' 

For every node v in level t — 2 define a{v) = 1/(1 + A) and c{v) = 1/(1 + A(l + A)"*"^^) and now we 
obtain bounds on probability ¥(v ^ nodes at lower levels. Given a node v in level r < t — 2, suppose 
P(t> ^ I\b) belongs to an interval [a{v), c{v)]. Then for every node v with children nodes vi, . . . , Vj.(^v)-i 
we obtain 

aiv) = r-Ff < P(u ^ I\b) < --pf -— = c{v). (10) 

l + Ani<,<r(.)-ic(^.) " l + Ani<,<K.)-i«(^j) 

Also, inductively assuming a{vj) > 1/(1 + X),c{vj) < 1/(1 + A(l + A)^'"+^, we obtain by the same 
argument as above that the same bounds hold for a{v), c{v) for all the node v in levels up to t — 2: 

< a{v) < c{v) < , , ^ (11) 



l + A - ' ' - ' ' - 1 + A(l + A)-'-+i 

We note that these bounds only depend on the tree T but not the boundary condition b. We now show 
that , the length of the bounding interval c{v) — a{v) is geometrically decreasing in as a function of the 
level of V in our special case of interest. 



Lemma 1 Suppose r = 4, A = 1. Then for every node v €z T in level r, c{v) — a{v) < (.9) 



t~2-T 



Proof : The proof proceeds by reverse induction in r starting with t = t — 2. For t = t — 2 the bound 
holds trivially from < a{v),c{v) < 1. Assume it holds for levels r + 1, . . . , t — 2 and consider any node 
V in level r with children vi, . . . , Vk,0 < k < r — 1. IfA; = then a{v) = c{v) = 1/(1 + A) and the bound 
holds trivially. Now suppose k > 0. Introduce function / : [(1 + A)-\ (1 + A(l + Xy+'^y^f R 
given by f{z) = /(zi, ...,Zk) = {l + ^^UiKjKk ■ We rewrite JTUl) as f{c{vi), . . .,c{vk)) = a{v) < 
c{v) = f{a{vi), . . . ,a{vk)), where a{vj),c{vj) satisfy the bounds in (fTT|) . Function / is differentiable on 
its domain. By mean value theorem, there exists z G [(1 + A)~^, (1 + A(l + A)^'"^"'^)^"'^]*'' such that 

c{v) - a(v) = Vf{z){a{vi) - c{vi), a{vk) - c{vk)) 



< l|V/(z)||i max \a{vj) - c{vj)\ 

< l|V/(z)||i.9 



t-2-T+l 



where the last bound follows from the inductive assumption. It then suffices to prove that || V/(z) || i < .9. 
We expand ||V/(2;)||i as 



-1 



We now resort to our specific assumption r < 4, A = 1. The remainder of the proof is computer assisted. 
For given A: < 4, consider a resolution .001 grid on the rectangle [(1 + A)^-*^, (1 + A(l + A)^''^^)^^]''', 1 < 
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A: < 4. We note that the right end (1 + A(l + A) ^ of the rectangle is largest when r = 4, so 

we consider the set of vectors z = (zi,. . . ,Zk) of the form Zj = .OOlrrij, for some mj £ N such that 
1/2 = (1 + A)"^ < Zj < (1 + for ah j. We have checked numerically using MATLAB that 

for every k = 2,3,4 and every point z on this A;-dimensional grid, the value of ||V/(2;)||i is at most 
.8736. Specifically, the maximum values for k = 2,3,4 (using rational computations) turn out to be 
1089/2500 PS .4356, 109/165 ^ .6606, 825/943 ^ .8749, respectively We now use first order Taylor 
approximation to argue that the maximums maxV/(z) over the domain of / are at most .9 for all 
k = 2,3,4. For every z in the rectangle find any of its grid point approximation z = 
meaning \zj — Zj\ < .001 (typically many such approximations exist and we choose any of them). Let 
9 — 11^/ 111- We now show that for every two vectors z^, z^ which coincide in all the coordinates except 
for one, and such that ||2:^ — -^^H < -001, we have 

|<7(zi) - 5(^2)1 < .013. (12) 

This results in |/(-z) — f{z)\ < -OlSk < .013 • 3 < .039 and, combining with the bound on points on the 
grid we obtain that for every point z on the domain V/(z) < .8749 + .039 < .9 and the proof of the 
lemma would be complete. 

To estimate the difference \g{z^) — g{z'^)\ we assume, w.l.g. that the two vectors differ in the first 
variable zi. Applying second order Taylor expansion for the first variable zi we obtain that for some 
value 6 between zJ and zf, 

= + ^(.? - .!) + - 4 f (13) 

For convenience, denote generically n2<i<A: by A, n2<j<fc T,2<j<k ^J^ by B, and n2<i<fc by C. 



Trivially, we have A < 1, B < k - I < 2,C < 1. We have g{z) = 



Bzi+C 



and 



dg{z) _ B{1 + AziY - 2{Bzi + C)(l + Azi)A 
dzi ~ (1 + Azi)4 

_ B{1 + Azi) -2{Bzi + C)A 

~ (1 + Azi)3 

Which in absolute value does not exceed max{B/{l + Af,2{B + C)A/{l+Af)) < max{B, 2{B + C)A) < 
6, using the bounds on A,B,C and 1 + Azi > 1,0 < zi < 1. Then the absolute value of the second 
term in the sum in (|13() is bounded by 6 • .001 = .012. We now bound the term corresponding to the 
second derivative, which find to be 



Q^g^^-^ BA{1 + AziY - 2BA{1 + AziY - [B{1 + Azi) - 2{Bzi + C)Aj3{l + AziYA 
" {l + Azif ■ 

We very crudely upper bound the absolute value of ^^f^^-* as 

BA + 2BA + {B + 2{B + C)A){'iA) < 2 + 4 + (2 + 6)3 = 24, 

again using the bounds A < 1,B < 2,C < 1,0 < zi < 1, Azi + 1 > 1. Thus the third term in the sum 
((T^ is upper bounded by (1/2)12 • .001^ = 6 • 10"'*. Combining, we obtain from (fT^ and the obtained 
bounds on the first and second derivative, that \g{z^) — giz'^)\ < -012 + 6 • 10"'* < .013. We established 
(|12|) . This completes the proof of the lemma. ■ 
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Application of the lemma to the root node vq yields, c{vq) — a{vQ) < (.9)* ^. Combining this with 
(jTU]) applied to vq gives for every two boundary conditions 61, &2 



(vo ^ I\h) - Fivo i I\h2) < civo) - aivo) < (.9)'-^ 

This establishes (jSJ and completes the proof the first part of the proposition. 

The second part of the proposition is the result already established by Kelly |Kel85j and we simply 
refer to his paper. ■ 



4.2 Algorithm and the proof of Theorem [T] 

Proposition|21establishes the key correlation decay property for independent sets for trees with maximum 
degree at most 4. It shows that the marginal Gibbs probability at the root is asymptotically independent 
from the boundary. Equipped with this result and Proposition ^ we propose the following algorithm 
for estimating the number of independent sets of a given graph G. 

Algorithm CountIND 

INPUT: A graph G with a node set vi,...,Vn and parameter e > 0. 
BEGIN 

1. Compute the girth g{G) . If (.9)^ compute I{G) by exhaustive enumeration. 
Otherwise 

2. Set G' = G, Z = l, t = g{G)/2. 

3. Find any node v £ G' and identify its depth— t neighborhood T(v) — the set of all 
nodes at distance <t from v. 

4. Perform subroutine CountingTREE on T{v) which results in some value p{v) . Set Z 
equal to Zp^^{v). 

5. Set G' = G'\{v} and go to step 3. 
END 

OUTPUT: Z. 

Subroutine CountingTREE 

INPUT: A tree T with an identified root v and depth t. 
BEGIN 

1. Identify the nodes u in level t (if any exist) and set p{u) = 1/2. 
FOR l = t-l,t-2,...,0 

Identify a node u in level / (if any exist). If u has no children, set p{u) = 1/2. 
Otherwise set p{u) = 1/{1+Ylp{ui)) , where the product runs over children Uj of u in level 
^ + 1 and the values p{ui) were obtained in an earlier step. 

END 

OUTPUT: p{v). 

Proof : Proof of Theorem^ We claim that the algorithm CountIND provides PAS. Fix a family of 
graphs Q with maximum degree r < 4 and large girth, a graph G £ G and e > 0. The algorithm first 
checks whether g{G) > 4 + 21og(l/e)/ log(10/9). By definition there exists a finite number of graphs in 
Q with girth < 4 + 2 log(l/e)/ log(10/9) and their corresponding values of T can be found in constant 
time, where the constant depends on e and the growth rate / of girth. 

g(G) r. 

Otherwise the girth satisfies (.9) 2 < e and in the remaining n steps of the algorithm the Gibbs 
marginal probability F{vk G I) is computed with respect to the depth t = g{G)/2 neighborhood T{vk) 
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of the node Vk with respect to the graph Gk-i- By selection of t, T{vk) is a tree (the girth of each 
subgraph Gk-i is triviaUy at least g{G)). Let B{T{vk)) be the boundary of T{vk) and consider the 
graph Gfc-i = {Gk-i \ T{vk)) U B{T{vk)), that is everything but the first t — 1 levels of T{vk)- Every 
independent set / which is a subset of Gk-i induces a boundary condition b = b{I) on T{vk) via 
its intersection with B{T{vk))- Let 60 denote an empty boundary condition on T{vk) (also called free 
boundary). This corresponds to all independent sets / which do not intersect with B{T[vk))- Then with 
respect to the tree T{vk) we have P2-(^^,)(t>fc ^ I\bo) = Fj'(^^^^{vk ^ /)• We have for every independent 
subset / C Gk-i that ^Ck-ii'^k ^ /|/n Gk~i = I) = ^T{vk)i'^k ^ -^|^(-^)) since T{vk) intersects with 
Gfc-i only on B{T{vk))- Proposition |31 implies that 



\t~2 



, , 9(G) 

(.9)— - 



< e. 



M-^)(v^ i I\bo) - ^TMivk i mi)) < (.9)* 
Then by summing over all possible realizations of / we obtain 

\rTi.,){vk^I)-FG,_M^I)\<e. 
The lower bound part of © gives Fx(v^.)ivk ^ -f) > 1/(1 + A) = .5. Then 

\i.,){vk^I)-FG,_Avk^I) 



{Vk 



J-1 

Gk- 



,)(^'fe i I) 



< 



We conclude 



Gl: 



iJvk i /)(1 - 2e) < F-^\,^^{vk il)< FcLi^^fc ^ + 



The value Fj}^^^^{vk ^ is what algorithm CountTREE outputs as p ^{v). Therefore, applying Propo- 
sition ^ we have that Z, the product of these outputs satisfies 

n n 

Z(l, G)(l - 26)" = n ^cl^S'^k im- 2e)" < Z < n IP5L,(^^ ^ + 26)" = Z{1, G){1 - 2e)\ 

k=l k=l 

Using I log(l — 2e)| < 3e for sufficiently small e, we obtain 

logZ logZ(l,G) 



n 



n 



< 3e. 



Finally, we observe that since, by bounds (|1H) each element of the product Z belongs to the interval 
[1 + A(l + A)-''+\ (A + 1)/A] = [9/8,2/1], then logZ/n > log(9/8). Therefore 



(l-3elog-i(9/8)) < 



logZ 



< (l + 3elog-i(9/8)). 



logZ(l,G) 

Thus the algorithm CountIND is PAS for counting independent sets. ■ 
4.3 Regular graphs and proof of Theorem [31 

The second part of Proposition |31 provides an explicit limiting expression for the probability that a given 
node belongs to an independent set selected according to the Gibbs distribution. In this subsection we 
use it to obtain explicit asymptotics for the logarithm of the number of independent sets in regular 
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Figure 1: Rewiring on nodes vi and V2 



graphs. Theorem ^ provides a way in principle for computing number of independent sets in regular 
graph. The problem is, however, in the fact that the cavity step expressed in (P) destroys regularity: 
when node f i is removed, the remaining graph is no longer regular and it is not clear how to estimate 
product explicitly. The help comes from a trick introduced by Mezard and Parisi fMPDSj, also 
used in |RBMM04] in the context of random regular graph. Given an n-node r-regular G fix any two 
nodes vi,V2 which are not neighbors, and do not have common neighbors (if there are any) and denote 
their non-overlapping neighbor sets by vn, . . . ,vir and V21, ■ ■ ■ ,V2r, respectively. Consider a modified 
graph G° obtained by from G by deleting vi,V2 and connecting vij to V2j, j = 1, . . . , r by an edge, see 
Figure 1^31 for an example with r = 3. The resulting graph is r-regular again. We call this operation 
"rewiring" or "rewire" operation. Rewiring was used in |MPn5j and |E,BMMn4j was in a context of 
random regular graphs and was performed on two nodes selected randomly from the graph. The main 
question is whether we can relate the partition functions of the original and modified graphs and whether 
the resulting graph still has a sufficiently large girth, provided the original one does. The first issue has 
been addressed in |RBMM04] and is essentially a simple combination of type ^ arguments. The second 
issue was not addressed in jEBMMfM] in a rigorous way. It was just postulated that the resulting graph 
again has a large girth if the two nodes are selected uniformly at random. 
We begin by addressing the second issue first. 

Lemma 2 Given an n-node r-regular graph G, consider any integer A < g < g{G). The rewiring 
operation can he performed for at least (n/2) — {2g + l)r^^ steps on pairs of nodes which are at least 
2g + 1 distance apart. In every step the resulting graph is r-regular with girth at least g. 

Proof : In every step of the rewiring we delete two nodes in the graph. Thus when (if) we 
performed t < (n/2) — {2g + l)r^^ successful rewiring steps, in the end we obtain a graph with at least 
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n - 2((n/2) - {2g + l)r2f) = 2{2g + l^^s nodes. Suppose in step t < (n/2) - {2g + l)r2f we have a 
graph Gt which is r-regular and has girth at least g. We claim that the diameter of this graph is at least 
2g + l. Indeed, if the diameter is smaller, then for a given node v any other nodes is reachable from v by 



a path with distance at most 2g and the total number of nodes is at most ^ 



0<k<2g • 



< {2g + l)r2ff 



contradiction. Now select any two nodes vi,V2 G Gt which are at the distance equal to the diameter of 
this graph, and thus are at least 2g + 1 edges apart. We already showed that the graph Gt+i obtained 
by rewiring Gt on vi,V2 is r-regular. It remains to show it has a girth at least g. Suppose, for the 
purposes of contradiction, Gt has girth < g — 1 and k >1 out of r newly created edges participate in 
creating a cycle with length <5'— 1. IfA; = l and vij,V2j is the pair creating the unique participating 
edge, then the original distance between vij and V2j was at most 5 — 2 by following a path on the cycle 
which does not use the new edge. But then the distance between vi and V2 is at most g < 2g + 1 - 
contradiction. Suppose there are k > 1 edges which create a cycle with length < g — 1. Then there 
exists a path of length at most {g — l)/k < {g — l)/2 which uses only the original edges (the edges of 
the graph Gt) and connects a pair v, v' of nodes from the set vu, . . . , vir, f2i, . . . , V2r- If the pair is from 
the same set, for example v = vij,v' = vn, then, since these two nodes are connected to vi, we obtain 
a cycle in Gt with length (g — l)/2 + 2 < g - contradiction, since, by assumption g > 3. If these two 
nodes are from different sets, for example v = vij,v' = V21, then we obtain that the distance between vi 
and V2 is at most {g — 1) /2 + 2 < 2g -\- 1 - again contradiction. We conclude that Gt has girth at least 
g as well. ■ 
We now turn to the second problem of estimating the relative change of the partition function after 
rewiring. This relative change is called energy shift in |RBMM04j . First we provide an elementary 
analogue of 

Lemma 3 Given an r-regular graph G, given A > and graph G° obtained from G by rewiring on 
nodes vi,V2 G G, the following relation holds 



Z{\G°) 



IPg(i'1,1'2 i I)'^G\{v^,V2}{^l<3<r{vij ^ I V V2j ^ I)) 



where Vij,j = 1 



Z{X,G) 

,r is the set of neighbors ofvi,i = 1,2 in G. 



Proof : The proof is almost identical to the one of Proposition ^ The partition function Z{X, G°) 
is obtained as a sum AI^I over the set of independent subsets / C V{G), which do not contain vi,V2 and 
which contain at most one of the two nodes vij,V2j for each j = l,2,...,r. H 

We now obtain a very simple limiting expression for the probability in Lemma |31 

Lemma 4 Given r G N, A < (r — lY~^/{r — 2Y and e > 0, there exists a sufficiently large constant 
g = g{r, e. A) such that for every graph G with girth g{G) > g, and for every pair of nodes vi,V2 £ G at 
distance at least 2^ + 1 



FGiivi,V2^I)) 



1 



(2-x) 



and 



where Vij,j = 
X = 1/(1 + Xx' 



'G\{vuV2}{^l<j<r 



{vij i IVv2j i I)) - {2x-x'^y 



(14) 



(15) 



1,... 



is the set of neighbors of Vi in G, 



1,2, and x is the unique solution of 
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Proof : The proof consists of several steps, each ideologically very similar to the one for Theorem^ 
Fix e > and let g = g{e, r, A) be a large value to be specified later. Select a = a{\) is selected as 
in Proposition We consider any r-regular graph with girth at least g and consider any two nodes 
vi,V2 in G at distance at least 2g + 1, if such two nodes exist. Consider depth t = g/2 neighborhoods 
T{vi),T{v2)- By the distance assumption, they do not intersect, and by the girth assumption, each 
neighborhood is a depth-t r-regular tree. First estimate the impact of deleting these nodes vi,V2 from 
G. That is we first take G° = G \ {^i, ^2} and consider Z{X, G \ {vi,V2})/Z{X, G). Then we will take 
G° obtained by rewiring G on vi,V2 and estimate Z{X,G")/Z{X,G \ {^1,^2}). 

Fix any independent set I on G = B{Tivi))U B{T{vi)) U {G\{T{vi)UT(v2))), where B{T) is again 
the boundary of a tree T. Let bi = I Ci B(T{vi)),i = 1,2. Let / be the random independent set in G 
selected according to the Gibbs distribution with parameter A. We have by Gibbs property that 

¥g{vi,V2 iI\Ir\G = I) = ^g{vi ^ I\I n G = I)Fg{v2 ii\ir\G = i) 

= ¥Ti^,,){vi i I\bi)^T{v,){v2 i I\b2) 

From the second part of Proposition 

\^T(v^){vi i I\bi) - < a\ i = l,2, 

which results in 

\FG{vi,V2^I\InG = I)-i-^f\ <a* + a*-^. 

2 — X 2 — X 

By summing over all the realizations of / we also obtain 

2 — X 2 — X 

We take t = g/2 = g{e,r, A) sufficiently large, so that the absolute difference above is at most e (note 
that the choice depends on a which in itself is controlled by A). This concludes the proof of the first 
part. 

Now consider PG°(Ai<j<r(t'ij ^ / V V2j ^ I))- We take depth-(t — 1) neighborhoods of Vij,k = 
1,2, j = l,...,r and again observe that they are all non- intersecting trees because of the girth and 
distance between vi and V2 assumption. By conditioning on the realizations / of a random independent 
set / in Gi = {Gi\[JijT {vij))U (Uij B {T (vij))) , letting bij = Ir\B{T{vij)) and using the same argument 
as above, we obtain 

( ^l<j<r {Vlj V2j ii)\ir\Gi= /) 

= n i^nv,,){vi3 i I\bi,) + IPt(.,,)K- ^ I\b2,) - IPtk,)K- i /|6i,)Pr(.,,)(^2, i I\b2,)) 
i<i<'- 

Again we use bound provided by Proposition |31 

\fT{v,^){vi, G I\bij) - (1 - x)\ < a'-\ i = l,2, j = l,2,...,r. 
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(we recall that each tree T[vij) has depth t — \ and the root Vij of this tree has degree r — 1). We now 
take t = g/2 = g{e, r, A)/2 sufficiently large so that 



IPg? ( Ai<,-<. {vij ^ / V V2j ^ /)| J n Gi = /j - (1 - (1 - xfy 
By summing over all the realizations of / we obtain 

( ^l<j<r {Vlj V2j i I)) - (2x - x'^Y < e. 



< e. 



Proof : Proof of Theorem The proof is obtained by combining the results of Lemmas I2I3I4I 
From the last two lemmas, for every e we can find g = g(e, r, A) sufficiently large so that for every graph 
G with girth at least g + 1 and for every two nodes vi,V2 at distance at least 2g + 1, the graph G" 



obtained from G by rewiring on vi,V2 satisfies, after simplifying (2 
the following bounds. 



'{2x 



to x''(2 



(l-e)x'^(2-xr 



Z{X,G) 



Here we note that in order to combine the individual absolute differences H14|) and (|15() . we need to 
take g = g{e, r, A) which is sufficiently large with taking x into account. But x itself depends only on 
A. Therefore such g indeed exists. By Lemma|21 if the original graph G has n nodes, then the rewiring 
can be performed for at least N = n/2 — G = n/2 — G{g, r) = n/2 — C(e, r. A) steps, and at most n/2 
steps, where constant G = G{g,r) = {2g + l)r^^. Let G* denote the graph obtained from G after 
rewiring steps. Then from the bound above 



(l-e)t-^(x"(2-xr-")t 



< 



Z{X,G*) 
Z{\,G) 



< (l+e)^(x"(2-a;)"-2)i 



Since the number of nodes in G* is at most 2C, then trivially Z{X, G*) < (1 + A)^*^, then we obtain for 
sufficiently large n(e, r, x, G) = n(e, r, A), that for all n > n(e, r, A) 



logZ(A,G) 



n 



log X 2 (2 



< 2e. 



This concludes the proof of the first part of the theorem. 

The case A = 1 corresponds to the counting problem. We check that (r — 1Y~^ /{r — 2Y > 1 only 
for r = 2, 3, 4, 5 and thus for these values we can obtain the asymptotics of the log-partition function, 
and we do so now. 

In the special case r = 2 and A = 1 we find that x = ~ 0.6180, derived from the golden ratio 

equation x = l/{l + x). Thus the total number of independent sets I{G) in every 2-regular graphs with 
large girth is ~ ( ^ ^ )" ^ (1.618 . . .)"■. As a sanity check there is a simple way to check the validity of 



this answer, for example in a special case when the graph is an n-cycle. We note that for every node v 
on a cycle, if it belongs to the independent set, its right-hand side neighbor v' does not, but if v does 
not, then v' either belongs or does not belong to the independent set. It is a simple exercise to see that 
the number of independent sets which can be created on a path of length k starting from v and going 
to the right is 



'1 1 



1 

1 1 



fc-i 
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The growth rate of this expression is determined by the largest eigenvalue of the matrix, which is 
the golden ration value 2/(\/5 — 1). Thus on the path of length n the number of independent sets is 
~ (2/(\/5 — l))""- The number of independent sets on a cycle differs from this only by a constant factor 
(to adjust for a fact that the last node and the first node v do belong to the independent set at the 
same time). 

When r = 3, A = 1, the solution x to the equation x = 1/(1 + x^) is found numerically to be 
X = 0.682 ... . Thus 1{G) for every 3-regular is ~ (1.545 . . .)". When r = 4, A = 1, we find similarly 
that 1{G) for every 4-regular is ~ (1.494 . . and when r = 5 it is ~ (1.453 . . .)'^. This concludes the 
proof of Theorem El H 



5 Counting Colorings 

The general approach for solving the problem of counting the number of proper colorings is the same as 
for independent sets. We establish correlation decay property for arbitrary graphs with bounded degree 
and large girth. We construct an algorithm exploiting this correlation decay. Then we focus on regular 
graphs, where explicit results can be obtained. Unlike the results for independent sets, our results for 
coloring do not have explicit bounds on the degree of the graph. 



5.1 Coloring of trees and correlation decay 

We use the definitions and notations of Subsection 14.11 T, B{T),B{T) denote respectively an arbi- 
trary depth-t tree with maximum degree at most r, the boundary of the tree and the set of boundary 
conditions. The latter, however, is defined as the set of functions b : B(T) — > {1,2, ... ,q} mapping 
nodes to colors. The root of this tree is vq. Similarly to the case of independent set, we use notation 
¥{C{v) = j\b) to indicate probability that the random coloring C assigns color j to the node v £ T, 
subject to the boundary condition b, where probability is with respect to the Gibbs measure, (in this 
case uniform distribution) on the set of all proper colorings. 

We need an analogue of Proposition 13 and in this case we use the following result by Jonas- 
son |.)onn2j . This result was used to establish uniqueness of Gibbs measures for coloring on infinite 
trees, but the main underlying result is a very strong form of correlation decay. (We note that Jonasson 
uses r -|- 1 in place of r for the degree of a tree) . 

Theorem 7 (Jonasson [J on02| .) Suppose q > r + 1. There exists a computable value (5 = f3{r) < 1 
such that for every r -regular tree T with depth t 



sup 

b£B(T) 



F{C{vo)=j\b)-- 

q 



for every j = l,2,...,q. 



This result says that the color received by the root vq is independent from the colors of the boundary 
in a uniform way as a function of the depth. Note that the decay constant /3 does not even depend on 
q provided that q > r + 1. The analysis of the proof in |Jon02j reveals that the same result holds for 
non-regular trees as well. 

Corollary 1 The result of Theorem ^ holds when T is an arbitrary depth-t tree with maximum degree 
r. 



18 



5.2 Algorithm and the proof of Theorem [51 

We propose the following algorithm for estimating the number of g-colorings of a given graph G. 
Algorithm CountCOLOR 



INPUT: A graph G with maximum degree r such that q>r + l, a node set vi, . . . ,Vn, and 
a parameter e > 0. 
BEGIN 

g(G) o 

1. Compute the girth g{G) . If /? 2 >e compute C{G) by exhaustive enumeration. 
Otherwise 

2. Set G' = G, Z = l, t = g{G)/2. 

3. Find any node v G G' and its degree r' = r{v,G) < r. Set Z equal to 



Z[q{l - -f] 

4. Set G' = G'\{v} and go to step 2. 
END 

OUTPUT: Z. 

Proof : Proof of Theorem O The proof is very similar to the one of Theorem ^ Applying 
Proposition |21 we need to estimate in each step of the algorithm the expected value of used colors 
Kq^ [\C{N{vk, . By fixing any boundary condition on depth-i neighborhood of Vk in the graph 

Gk-i the probability of any particular coloring of the nodes in N{vk,Gk-i) is product of individual 
coloring probabilities. Each individual coloring probability is asymptotically 1/q provided t is large 
by Corollary ^ Therefore given a fixed color i < q, the probability that this color was never used 
in coloring nodes N{vk,Gk-i) is asymptotically (1 — l/qY , where r' is the degree of Vk in the graph 
Therefore q — Kq^ [\C{N{vk, Gk-i))\\ is asymptotically q{l — l/qY , provided that t = g{G)/2 is 
sufficiently large. 

The rest of the argument follows the lines the proof of Theorem ^ ■ 



5.3 Regular graphs and proof of Theorem HI 



Our main tool is again rewiring performed on regular graphs with large girth. Given an arbitrary graph 
G and nodes vi,V2 G G such that vi and V2 are not neighbors, and they do not have a common neighbor, 
let G° be obtained from G by rewiring on vi,V2- Proposition [21 already relates the partition function of 
G to the one of G \ {vi, ^2}. We now relate it to the one of G°. Let G' = G \ {^1,^2}. That is G' is G" 
before the pairs vij,V2j are connected. Consider a random uniform g-coloring C selected in G' . The 
lemma below does not rely on assumptions of regularity or the girth size of the underlying graph G. 



Lemma 5 The following relation holds 



Z{q,G) _^G^ 



\C{N{vi,G))\){q-\C{N{v2,G))\) 



where Vij,j = 1 



Ziq, G°) Pg'(C(^1j) / C{v2j), 1 < J < r) 

, r is the set of neighbors ofvi,i = 1,2 in G. 



Proof : Using the same argument as in Proposition [21 we obtain that 
Ziq,G) 



Ziq,G') 



E, 



G" 



{q-\C{N{v,,G))\){q-\C{Niv2,G))\) 
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On the other hand 



Z{q,Go) 



zi^q^G') ^® probabihty that a randomly selected coloring in G' assigns different 
colors to each pair vij,V2j,j = 1,2, ... ,r. Combining, we obtain the result. H 
The following lemma is an analogue of Lemma ^ 

Lemma 6 Given rSN, (7>r + le>0, there exists a sufficiently large constant g = g{r, e) such that 
for every r -regular graph G with girth g{G) > g, for every pair of nodes vi,V2 £ G at distance at least 
2g + l 



E, 



G' 



\C{N{v^,G))\){q-\C{N{v2,G))\) 



2r 



< e. 



Fg>{C{v,j) + C{V2,), 1 < J < r) - )' 



< e. 



(16) 
(17) 



Proof : The proof is very similar to the one of Lemma 0] In the graph G' consider depth-t = g/2 
neighborhoods of nodes Vij. By girth assumptions these neighborhoods are non-intersecting r-regular 
trees Tij, with the exception that the each root Vij has degree r — 1. Fix any collection of colors 



Cij £ {1,2,. . . ,q}, i = 1,2, j = 
are non-intersecting, we obtain 



1,2, 



, r. 



Applying Corollary ^ and using the fact that the tree 



^G'iCivi 



< e. 



(18) 



provided g = g{e, r, q) is sufficiently large. Thus, under Pg/ the random colors {C{vij)} are approximately 
independent and each uniformly distributed on the set of colors {1, 2, . . . , q}. Thus ()16() and (|17() follows 
by choosing e as e/g^ in (|18j) . ■ 



Proof : Proof of Theorem ^ The proof follows the same steps as the proof of Theorem The 
results of Corollary ^ and Lemmas El El El are combined to obtain the limiting expression after the 
cancelation of {^^^Y . ■ 



6 Random regular graphs 

We prove now Theorems I5I6I 

Proof : Proof of Theorem\^ We use the following fact about random regular graphs (see |.TLR,flnj ) : 
given any constant g > the total number of cycles with length < g is w.h.p. at most some constant 
ci = C2{g)- Thus given G = Gr{n) there exists a graph G obtained from G by removing at most 
(1 -|- 2 -|- . . . -|- g)ci{g) = C2{g) edges, such that G has girth at least g. Observe that all but some 
constantly many nodes 03(5) of G have degree r. We now revisit the proof of Lemma 121 and apply the 
rewire operation to G with the following modification. First we observe that the result of the lemma 
still holds when we replace 2(7 -|- 1 by any large constant. Only the size of the remaining constant size 
graph may change. So we take some constant 04(5) instead ol 2g + 1, which is to be specified later. In 
every step if the pair of nodes vi , ^2 at a distance equal to the diameter of the current graph is such that 
vi and V2 have depth-^i neighborhoods which are regular trees, then we rewire on them. Otherwise we 
perform a breadth-first search for nodes v'l and which do. Note that for this purpose it suffices to find 
nodes which are outside of depth-5 -|- 1 neighborhoods of 03(5) nodes which have degree < r. This will 
occur after our breadth-first choice inspects at most C3{g){l -|- r -|- • • • -|- r^+^) nodes. The newly found 
nodes v'i,V2 are at distance which is at least diameter minus C3{g){l + r + ■ ■ ■ + r^~^^). We rewire on 
v'l, v'2. Since their depth-g( neighborhood are regular trees, then using the same argument as for regular 
trees, we obtain that the ration of partition functions is approximately given x~~{2 — x) ~, where 
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the level of approximation is controlled by g. We now select 04(5) = c^{g){l + r + • • • + r^"*"^) and use 
lemm^with 04(3) replacing 2g + \. The rest of the argument is the same as for the case of regular 
graphs. 

Theorem IHl is established in exactly the same manner. ■ 

7 Conclusions 

We have presented in this paper a new method for solving approximately some counting problems, which 
is not based on the Markov Chain sampling technique. We applied our method to independent sets and 
colorings in low degree graphs with large girth. The primary technical tool is a derivation of a certain 
correlation decay property which features prominently in statistical physics literature in connections 
with a completely different topic: uniqueness of Gibbs distributions on infinite trees. We certainly hope 
that our approach is more general and can be applied to other combinatorial problems. This constitutes 
an interesting direction for further research. Another research direction is removing the requirement 
of large girth, and here the difficulty is establishing correlation decay in non-tree like graphs. Such 
correlation decay was already established by Dobrushin IDob7nj back in 70's for lattice like graphs, but 
there is a recent extension by Weitz |Wei05j to a more general graphs. Perhaps this correlation decay 
(long-range independence) can be exploited to obtain non-Markov chain type algorithms for counting 
problems. Finally, it would be interesting to see if our approach can be converted to an algorithm for 
sampling from the uniform distribution, for example of independent set or coloring in the same class 
of low degree graphs with large girth. This would be a nice supplement to the classical approach of 
rapidly mixing Markov chains. 

Acknowledgement. We gratefully acknowledge several fruitful conversations with Marc Mezard, 
Richardo Zecchina and Dimitris Achlioptas. 
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